A unified framework for translation and understanding allowing discriminative joint decoding for multilingual speech semantic interpretation

نویسندگان

  • Bassam Jabaian
  • Fabrice Lefèvre
  • Laurent Besacier
چکیده

Probabilistic approaches are now widespread in most natural language processing applications and selection of a particular approach usually depends on the task at hand. Targeting speech semantic interpretation in a multilingual context, this paper presents a comparison between the state-of-the-art methods used for machine translation and speech understanding. This comparison justifies our proposition of a unified framework for both tasks based on a discriminative approach. We demonstrate that this framework can be used to perform a joint translation-understanding decoding which allows to combine, in the same process, translation and semantic tagging scores of a sentence. A cascade of finite-state transducers is used to compose the translation and understanding hypothesis graphs (1-bests, word graphs or confusion networks). Not only this proposition is competitive with the state-of-the-art techniques but its framework is even more attractive as it can be generalized to other components of human-machine vocal interfaces (e.g. speech recogniser) so as to allow a richer transmission of information between them.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Discriminative statistical approaches for multilingual speech understanding (Approches statistiques discriminantes pour l'interprétation sémantique multilingue de la parole) [in French]

Discriminative statistical approaches for multilingual speech understanding Statistical approaches are now widespread in the various applications of natural language processing and the elicitation of an approach usually depends on the targeted task. This paper presents a comparison between the methods used for machine translation and speech understanding. This comparison allows to propose a uni...

متن کامل

A Contrastive Analysis of Persian and English Compliment, Request, and Invitation Patterns within the Semantic Metalanguage Framework

Speech acts shape the core of pragmatic competence and their mastery is a prerequisite for successful dis- course encounters in an L2. Cross-cultural comparisons, as claimed by Johnstone (2018), are very effec- tive for knowing and acquiring the speech acts. Accordingly, the purpose of this study was to compare a limited number of communicative routines in English and Persian within the f...

متن کامل

Joint Processing and Discriminative Training for Letter-to-Phoneme Conversion

We present a discriminative structureprediction model for the letter-to-phoneme task, a crucial step in text-to-speech processing. Our method encompasses three tasks that have been previously handled separately: input segmentation, phoneme prediction, and sequence modeling. The key idea is online discriminative training, which updates parameters according to a comparison of the current system o...

متن کامل

Articles: A Framework for Fast Incremental Interpretation during Speech Decoding

This article describes a framework for incorporating referential semantic information from a world model or ontology directly into a probabilistic language model of the sort commonly used in speech recognition, where it can be probabilistically weighted together with phonological and syntactic factors as an integral part of the decoding process. Introducing world model referents into the decodi...

متن کامل

A Framework for Fast Incremental Interpretation during Speech Decoding

This paper describes a framework for incorporating referential semantic information from a world model or ontology directly into a probabilistic language model of the sort commonly used in speech recognition, where it can be probabilistically weighted together with phonological and syntactic factors as an integral part of the decoding process. Introducing world model referents into the decoding...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Computer Speech & Language

دوره 35  شماره 

صفحات  -

تاریخ انتشار 2016